Towards matching user mobility traces in large-scale datasets

نویسندگان

  • Dániel Kondor
  • Behrooz Hashemian
  • Yves-Alexandre de Montjoye
  • Carlo Ratti
چکیده

The problem of unicity and reidentifiability of records in large-scale databases has been studied in different contexts and approaches, with focus on preserving privacy or matching records from different data sources. With an increasing number of service providers nowadays routinely collecting location traces of their users on unprecedented scales, there is a pronounced interest in the possibility of matching records and datasets based on spatial trajectories. Extending previous work on reidentifiability of spatial data and trajectory matching, we now present the first large-scale analysis of user matchability in real mobility datasets on realistic scales, i.e. among two datasets that consist of several million people’s mobility traces for a one week interval each. We extract the relevant statistical properties which influence the matching process and provide an estimate on a performance of matching and thus the matchability of users. We derive that for individuals with typical activity in the transportation system (those making 3-4 trips per day on average), a matching algorithm based on the co-occurrence of their activities is expected to achieve a 16.8% success rate based only on a one-week long observation of their mobility traces. Extrapolating for longer time intervals, we expect a success rate of over 55% after four week long observations. We further evaluate different scenarios of data collection frequency, giving estimates of matchability over time in several realastic cases of mobility datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predict User In-World Activity via Integration of Map Query and Mobility Trace

People often resort to map search engine or other locationbased services for location information when planning long trips or local navigation, and their map queries as well as mobility trace will be accumulated and stored in user log. These data offers valuable information for studying the mechanism of human mobility pattern, furthermore, map query data enable us to sense users’ real-time inte...

متن کامل

You Are How You Move: Linking Multiple User Identities From Massive Mobility Traces

Understanding the linkability of online user identifiers (IDs) is critical to both service providers (for business intelligence) and individual users (for assessing privacy risks). Existing methods are designed to match IDs across two services, but face key challenges of matching multiple services in practice, particularly when users have multiple IDs per service. In this paper, we propose a no...

متن کامل

Analyzing Mobility-Traffic Correlations in Large WLAN Traces: Flutes vs. Cellos

Two major factors affecting mobile network performance are mobility and traffic patterns. Simulations and analytical-based performance evaluations rely on models to approximate factors affecting the network. Hence, the understanding of mobility and traffic is imperative to the effective evaluation and efficient design of future mobile networks. Current models target either mobility or traffic, ...

متن کامل

Characterizing User Behavior and Network Load on a Large-Scale Wireless Mesh Network

Wireless mesh networks represent a promising paradigm to provide a scalable infrastructure for Internet access in metropolitan areas. In this paper, a large-scale wireless mesh testbed deployed in three cities in the Trentino region is described and experimentation results obtained from the public use of the testbed are reported and analyzed. The large-scale of the deployment and high number of...

متن کامل

MobReduce: Reducing State Complexity of Mobility Traces

User traces are essential for analysis of human behavior and development of opportunistic networking protocols and applications. As user traces are collected with high granularity to apply them in diverse scenarios, they have a high complexity resulting from the large number of user states. We present MobReduce: a methodology for reducing the number of states in user traces. We apply MobReduce ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1709.05772  شماره 

صفحات  -

تاریخ انتشار 2017